Optimal Web cache sizing: scalable methods for exact solutions

نویسندگان

  • T. P. Kelly
  • Douglas S. Reeves
چکیده

This paper describes two approaches to the problem of determining exact optimal storage capacity for Web caches based on expected workload and the monetary costs of memory and bandwidth. The first approach considers memory/bandwidth tradeoffs in an idealized model. It assumes that workload consists of independent references drawn from a known distribution (e.g., Zipf) and caches employ a “Perfect LFU” removal policy. We derive conditions under which a shared higher-level “parent” cache serving several lower-level “child” caches is economically viable. We also characterize circumstances under which globally optimal storage capacities in such a hierarchy can be determined through a decentralized computation in which caches individually minimize local monetary expenditures. The second approach is applicable if the workload at a single cache is represented by an explicit request sequence and the cache employs any one of a large family of removal policies that includes LRU. The miss costs associated with individual requests may be completely arbitrary, and the cost of cache storage need only be monotonic. We use an efficient single-pass simulation algorithm to compute aggregate miss cost as a function of cache size in O(M logM) time and O(M) memory, where M is the number of requests in the workload. Because it allows us to compute arbitrarily-weighted hit rates at all cache sizes with modest computational resources, this algorithm permits us to measure cache performance with no loss of precision. The same basic algorithm also permits us to compute complete stack distance transformations in O(M logN) time and O(N) memory, where N is the number of unique items referenced. Experiments on very large reference streams show that our algorithm computes stack distances more quickly than several alternative approaches, demonstrating that it is a useful tool for measuring temporal locality in cache workloads.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Packet-Level Simulation Study of Optimal Web Proxy Cache Placement

The Web proxy cache placement problem is often formulated as a classical optimization problem: place N proxies within an internetwork so as to minimize the average user response time for retrieving Web objects. Approaches to this problem in the literature include graph theoretic approaches, combinatorial approaches, dynamic programming approaches, and vector quantization approaches. In this pap...

متن کامل

Optimal Content Placement For En-Route Web Caching1

This paper studies the optimal placement of web files for en-route web caching. It is shown that existing placement policies are all solving restricted partial problems of the file placement problem, and therefore give only sub-optimal solutions. A dynamic programming algorithm of low complexity which computes the optimal solution is presented. It is shown both analytically and experimentally t...

متن کامل

Accelerating Apache farms through ad-HOC distributed scalable objects repository

We present hoc: a fast, scalable objects repository providing programmers with a general storage module. hoc may be used to implement DSMs as well as distributed cache subsystems. hoc is composed by a set of hot-pluggable cooperating processes that may sustain close to optimal network traffic rates. We designed an hoc-based Web cache that extends the Apache Web server and remarkably improves Ap...

متن کامل

Accelerating Apache Farms Through Ad-HOC Distributed Scalable Object Repository

We present hoc: a fast, scalable object repository providing programmers with a general storage module. hoc may be used to implement DSMs as well as distributed cache subsystems. hoc is composed of a set of hot-pluggable cooperating processes that may sustain a close to optimal network traffic rate. We designed an hoc-based Web cache that extends the Apache Web server and remarkably improves Ap...

متن کامل

Pricing and Capacity Sizing for Systems with Shared Resources: Approximate Solutions and Scaling Relations

This paper considers pricing and capacity sizing decisions, in a single-class Markovian model motivated by communication and information services. The service provider is assumed to operate a finite set of processing resources that can be shared among users; however, this shared mode of operation results in a service-rate degradation. Users, in turn, are sensitive to the delay implied by the po...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computer Communications

دوره 24  شماره 

صفحات  -

تاریخ انتشار 2001